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I. Real Party in interest 

Agilent Technologies, Inc., the assignee of the above referenced patent application is the 
real party in interest. 

II. Related Appeals and Interferences 

There are no related appeals or interferences. 

III. Status of Claims 

Claims 1-13 and 17-20 are pending in this application. 
Claims 14-16 and 21-34 are canceled. 

Claims 1-13 have been rejected under 35 USC § 102(b) as being anticipated by Singh et 
al., "Gene Expression Correlates of Clinical Prostrate Cancer Behavior", Cancer Cell, 
Vol, 1, March 2002. 

Claims 1-10, 12-13 and 17-20 have been rejected under 35 USC § 102(e) as being 
anticipated by Crosby et al, U.S. Patent Publication No. 2003/0190689. 

The rejections of Claims 1-13 and 17-20 are appealed. 

rV. Status of Amendments 

No amendments were filed after the Final Office Action dated January 8, 2007. 



2 



Minor 

Application Serial No. 1 0/82 1 ,829 
Pages 



V. Summary of Claimed Subject Matter 

A, Independent Claim 1 recites a method for rank ordering characteristic signatures 
of properties (stated in paragraph [0006] on page 3 of the specification). The method 
includes forming a plurality of characteristic signatures, which is described, for example 
at paragraph [0055], pages 16-17 of the specification. A characteristic signature may be a 
gene expression response signature (paragraph [0032}, page 8 of the specification), for 
example, of other profile of measurements of a particular characteristic of a target 
material (e.g., gas chromatograph readings, protein abundance analysis measurements, 
mass spectrometry readings, etc., see paragraph [0043}, page 1 1), wherein the 
measurements are taken from multiple locations (samples) along a pathway of the same 
target material contaimng tissue cells of interest and outlying tissues where it is relatively 
certain that none, or insignificant amounts of the cells of interest exist, see paragraph 
[0044], page 1 1 of the specification. A series of samples 108a, 108b, 108n are taken 
along a line 106 which extends through the tissue cells of interest 104 and outlying 
tissues 102 on both sides, or any other trajectory where expected changes in density of 
the tissue cells of interest can be predicted or hypothesized, see paragraph [0044], page 
1 1, of the specification, and Figs. 1-3. 

For each tissue sample 108a, 108b, 108n, a measurement of that tissue sample is 
taken for a particular property or characteristic that is to be characterized by the 
characteristic signature that is presently being formed, see paragraph [0049], page 14 of 
the specification. Thus for example, a first characteristic signature may be a vector of 
gene expression values for "gene A" measured for each of samples 108a - 108n, while a 
second characteristic signature may be generated for "gene B" as a vector of gene 
expression values for "gene B" , where a measurement of gene expression of gene B has 
been taken fi-om each of samples 108a-108n, and so forth. Fig. 4 shows an idealized, 
schematic representation of a plot 400 for measured characteristic signatures (gene 
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response expression profiles, in this case) 404, 406 and 408, corresponding to three genes 
from the samples, wherein 404a is the gene expression value for the gene represented by 
characteristic signature 404, measured at sample a, 404b is the gene expression value for 
the gene represented by characteristic signature 404, measured at sample b, etc. (see also 
paragraph [0050], page 14 of the specification). 

Independent Claim 1 further recites providing a trend profile of a second tissue measured 
property for the second type of tissue along the determined profile of locations through 
the tissue from which the samples were taken. For example, this may be a profile of a 
measure of disease activity across the sample tissue locations, or other characterization of 
the tissue of interest, e.g., see paragraph [0050], page 14 of the specification and Fig. 4, 
#402). Statistical analysis is performed with regard to the characteristic signatures 
relative to the trend profile, as those characteristic signatures that are closest to 
"synchronizing" or closely following the trend profile are considered to be related to, or 
involved in, the disease activity, or other characteristic of the tissue of interest that is 
measured by the trend profile, see paragraph [0051], page 15 of the specification, and 
Fig. 5. Rank ordering of the characteristic signatures, based on proximity to the trend 
profile is performed to identify those characteristic signatures that are considered to be 
most closely related to the trend profile, see paragraph [0055], page 17 of the 
specification and Fig. 5, #514. 

B. Independent Claim 17 recites a computer readable medium carrying one or more 
sequences of instructions for rank ordering characteristic signatures of properties (stated 
in paragraph [0006] on page 3 of the specification), measured from a plurality of samples 
taken from a heterogeneous region, wherein a first portion of the heterogeneous tissue 
region has at least first and second types of tissue and is bordered by a second portion of 
the heterogeneous tissue region, wherein the second portion is considered to be devoid of 
the second type of tissue, and wherein the plurality of samples have been taken from 
successive locations along a determined profile of locations through the heterogeneous 
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tissue region, with at least one sample being taken from the second portion, wherein 
execution of one or more sequences of instructions by one or more processors causes the 
one or more processors to perform the method steps which have already been described 
in detail with regard to independent Claim 1 above. 

C. Independent Claim 18 recites a system for rank ordering characteristic signatures of 
properties (stated in paragraph [0006] on page 3 of the specification) generated from 
tissue samples taken from a heterogeneous tissue region, wherein a first portion of the 
heterogeneous tissue region has at least first and second types of tissue and is bordered by 
a second portion of the heterogeneous tissue region, wherein the second portion is 
considered to be devoid of the second type of tissue. The system includes means for 
providing a trend profile of a second tissue measured property of the second type of 
tissue along a determined profile of locations through the heterogeneous tissue region 
from which tissues samples are taken as the sources of the characteristic signatures; 
means for performing statistical analysis on each of the plurality of characteristic 
signatures with regard to the provided trend profile; and means for rank ordering the 
plurality of characteristic signatures based on proximity to the trend profile as determined 
by the statistical analysis, each of which means may be embodiment by hardware 
described at paragraphs [0075] - [0076], page 25 of the specification, implementing 
multiple software modules for performing the method steps described in detail above 
with regard to Claim 1, see paragraph [0077], page 25 of the specification. 

VI. Grounds of Rejection to be Reviewed on Appeal 

A. Whether the invention as defined by Claims 1-13 is patentable under 35 USC § 
102(b) over Singh et al., "Gene Expression Correlates of Clinical Prostrate Cancer 
Behavior", Cancer Cell, Vol. 1, March 2002 (hereafter, Singh et al.) because Singh et al. 
does not disclose or inherently possess each and every feature recited in Claims 1-13 of 
the instant application. 
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B. Whether the invention as defined by Claims 1-10, 12-13 and 17-20 is patentable 
under 35 USC § 102(b) over Crosby et ah, U.S. Patent Publication No. 2003/0190689 
(hereafter, Crosby et al.) because Crosby et al. does not disclose or inherently possess 
each and every feature recited in Claims 1-10, 12-13 and 17-20 of the instant application. 

VII. Argument 

Al. Rejection of Claims under 35 USC § 102(b) - Singh et al. 

In the Final Office Action dated January 8, 2007 ("OA 01/08/07"), the Examiner rejected 
Claims 1-13 under 35 USC § 102(b) as being anticipated by Singh et al. 

Bl. Standard of Rejection under 35 USC § 102 

Under 35 U.S.C. § 102, every limitation of a claim must identically appear in a single 
prior art reference for it to anticipate the claim. In re Bond, 910 F.2d 831,832, 15 
U.S.P.Q.2D (BNA) 1566, 1567 (Fed, Cir, 1990). 

Cl(a). Application of Standard of Rejection under 35 USC § 102 to Claim 1 

Appellant/Applicant submits that the Examiner has not met the burden of presenting a 
proper rejection under 35 U.S.C. § 102(b) because not every limitation of Claim 1 
identically appears in the Singh et al. reference. 

The Examiner asserted that Singh et al. discloses taking a plurality of samples from a 
heterogeneous tissue region and the formation of a plurality of characteristic signatures 
therefrom. The Examiner referred to page 209, column 1, second full paragraph of Singh 
et al. as support for these assertions, see page 3, second paragraph under the "Response to 
arguments" section of OA 01/08/07. 
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The Examiner further asserted that Singh et al. teaches the formation of a plurality of 
characteristic signatures. As support for this assertion, the Examiner referred to Figure 3 
of Singh et al. indicating that this represents a plurality of characteristic signatures as a 
set of 12 genes, wherein the Examiner asserted that the measured expression of each gene 
listed in Figure 3 of Singh et al. forms a characteristic signature, and collectively, the 
genes constitute a plurality of characteristic signatures. 

In the Advisory Action dated March 21, 2007 (AA 03/21/07), in response to Applicant's 
Request for Reconsideration filed February 26, 2007, the Examiner asserted that Singh et 
al. teaches taking a plurality of samples from tissue obtained from radical prostatectomy. 
The Examiner referred to page 208, column 1 of Singh et al., stating that this was 
interpreted to mean that normal tissue was bounded on opposite sides by tumorous tissue 
in the removed prostrate specimen and that this was interpreted to read on the claim 
recitation of a "plurality of samples taken from a heterogeneous region". The Examiner 
fiirther indicated that Singh et al. teaches taking two samples, one of normal tissue and 
one of diseased tissue, from each of the 52 prostates obtained, and that this teaches the 
limitations of the claims reciting a plurality. 

Appellant/Applicant respectfiiUy traverses the Examiner's interpretations. First, 
Appellant/Applicant respectfiiUy submits that page 208, column 1 of Singh et al. does not 
disclose use of 52 prostate specimens from a total of 65 prostate specimens, contrary to 
the Examiner's assertion. Rather, page 208, column 1 discloses 235 samples of prostate 
tumors and adjacent prostate tissue not containing tumor (normal), of which, 65 had 
cancer present on opposing sides of the OCT embedded specimens. Each of these 
samples was disclosed to have been reviewed. 

Page 204, column 1 of Singh et al. discloses that high-quality expression profiles were 
successfiiUy derived from 52 of the prostate tumors and 50 of the nontumor (normal) 
prostate samples. Thus, the "expression profiles" derived for these samples were derived 
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independently of one another, i.e., 52 expression profiles were derived for the 52 prostate 
tumor samples and 50 expression profiles were derived for the 50 normal samples. Each 
expression profile derived was derived for a single sample only. Thus, it is respectfully 
submitted that Singh et al. discloses the derivation of "gene expression signatures" or 
"gene expression profiles", and not "characteristic signatures" or "gene expression 
response signatures" or "gene expression response profiles" as recited in the present 
claims. Page 8, paragraphs [003 1]-[0032] clearly distinguishes between these different 
types of signatures. Paragraph [0031] describes the type of expression profiles that are 
disclosed by Singh et al., which are the traditional "gene expression profiles", where a 
"gene expression profile" refers to gene expression values of a number of genes, typically 
fi-om the same sample. 



In contrast a "characteristic signature", such as a gene expression response profile refers 
to a profile generated bv expression values of the same gene over a number of samples . 
Claim 1 recites that each characteristic signature is formed of values for a particular 
property that has been measured fi'om a plurality of samples taken from a heterogeneous 
tissue region. 

Singh et al. clearly does not form a characteristic signature including a value from a 
normal tissue sample and a value from a tumor sample, contrary to the Examiner's 
assertion. Page 204, column 1 of Singh et al. discloses that "Genes were ranked 
according to their differential expression across the two classes (tumor versus normal), 
indicating that the expression profiles were each generated for a single sample (tumor or 
normal), not across a plurality of samples taken from the same tissue region. 

Further, since the number of tumor samples (52) and nontumor samples (50) is unequal, 
this fiirther supports that Singh et al. did not form characteristic signatures from values of 
property measured along different locations of a heterogeneous tissue region, since it 
would have been impossible to do so, in this case, for at least two of the tumor samples. 
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Instead, Singh et al. ranked gene expression values from each normal sample, relative to 
each other, and ranked gene expression values form each tumor sample, relative to each 
other. 

Still further, neither locations of the tumor tissue samples nor the normal tissue samples 
of the Singh et al. study are identified or tracked so as to be able to correlate locations of 
gene expression values with locations of values along a trend profile of a second tissue 
measured property. Singh et al. does not provide a trend profile of a second tissue 
measure property for the second type of tissue along a determined profile of locations 
through a heterogeneous tissue region. That is, Singh et al. does not provide a trend 
profile of an activity level of the tumor tissue through both normal tissue and tumor tissue 
for any one particular sample, nor does Singh et al. provide a determined profile of any 
type of measure of the cancer tissue through the tissue samples measured. Rather, Singh 
et al. simply compares gene expression values for tumor tissues, relative to one another, 
to attempt to identify those genes that are significantly expressed within the tumor 
samples. Likewise, Singh et al. compares gene expression values for normal samples, 
relative to each other (normal samples) to attempt to find genes that are significantly 
expressed in normal tissues. This study is absolutely location independent, and is 
dependent only upon the types of tissues that are being examined. There is no disclosure 
or suggestion of comparing a profile including values of even one tumor sample and one 
normal sample from the same heterogeneous tissue sample, to a trend profile developed 
from values of another property at the same locations from which the samples were 
taken. 

The Examiner fiirther asserted that Singh et al. at page 206, column 1, paragraph 2, 
discloses that a readily detectable and statistically significant signature of GS (Gleason 
score) exists that shows a correlation between GS and the measured gene expression 
profiles. However, in the line following the referred to disclosure, Singh et al. discloses 
"The expression pattem of these genes separated txmiors into distinct groups during 
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hierarchical clustering in both our initial and in a validation data set and grouped some of 
the intermediate grade tumors with high-grade tumors." The fact that Singh et al. refers 
to an "expression pattem" as composed of a plurality of "genes" further supports that 
Singh et al. generates "gene expression profiles", where each "gene expression profile" 
includes gene expression values of a number of different genes from the same sample, 
and that Singh et al. does not form a characteristic signature having values for the same 
gene across a number of locations measured in a single heterogeneous tissue region. 

The Examiner further asserted in AA 03/21/07, that "Singh et al. also teach gene 
expression signature profile composed of a plurality of gene expression signatures of GS 
(p. 204, col. 2 para. 2, lines 1-5)." Upon referring to page 204, column 2, paragraph 2, 
lines 1-5 of Singh et al., Appellant/Applicant notes that this portion of the disclosure 
describes analyzing expression pattems within the 52 tumor samples. Accordingly, there 
were no gene expression values taken from the 50 normal samples for this portion of the 
study. Appellant/Applicant further respectfully submits that the "gene expression 
signature profiles" referred to by the Examiner and disclosed by Singh et al. as 
expression pattems are the traditional "gene expression profiles" described in the present 
application specification at page 8, paragraph [0031], as Singh et al. does not disclose or 
suggest forming a profile of gene expression values for one gene across a plurality of the 
samples. Still further, even if this were the case, and Singh et al. did form a profile of 
expression values for the same gene across a plurality of the 52 tumor samples, which 
Appellant/Applicant does not agree that Singh et al. discloses this, this would still not 
read on a characteristic signature formed of values for a particular property having been 
measure from a plurality of samples taken from a heterogeneous region, as the 52 tumor 
samples are not taken from a single heterogeneous region, but from 52 different patients. 
Still further, such a hypothetical characteristic signature (not disclosed by Singh et al.) 
would also not include a value from a normal sample, as Singh et al. clearly discloses that 
only the 52 tumor samples are being analyzed in this portion of the disclosure. 
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The Examiner further asserted in AA 03/21/07, that "Singh et al. provide a trend profile 
for the second type of tissue and perform a statistical analysis on each of the 
characteristic signatures." The Examiner asserted that Singh et al. teach a profile of a 
second tissue determined along the profile of locations in providing a "normal" gene 
expression profile, as disclosed at page 204, column 1. Appellant/Applicant does not 
understand this argument. Singh et al. discloses that genes were ranked according to their 
differential expression across the two classes (tumor versus normal). There is no 
disclosure of a trend profile along a determined profile of locations through a 
heterogeneous tissue region. Rather, Singh et al. merely ranks the expression values of 
genes within two different classes of tissues samples, i.e. a rank ordering of expression 
values within the normal samples group is performed and a rank ordering of expression 
values within the tumor samples group is performed. This has nothing whatsoever to do 
with correlating signatures with a trend profile based on locations of a heterogeneous 
tissue sample from which the values were taken. 

The Examiner asserted in AA 03/21/07 that "A gene is interpreted to be a property and 
gene expression is interpreted to be characteristic signature." From this, the Examiner 
concluded that "Singh et al. teaches obtaining gene expression data (characteristic 
signatures) from normal and tumorous samples for 12,000 genes (properties). 
Appellant/Applicant respectfully submits that the Examiner's interpretation scheme 
referred to above falls apart logically in the portion of claim 1 that recites that a 
characteristic signature is formed of "values for a particular property having been 
measure from a plurality of samples taken fi-om a heterogeneous tissue region". The 
12,600 genes referred to by Singh et al. at page 204, column 1 are all genes from the 
same tissue sample as measured on a single microarray. Accordingly, each gene 
(property, according to the Examiner's scheme above) is associated with only one 
expression value, for only one tissue sample. Singh et al. does not disclose combining 
gene expression values across different microarrays, for a single gene, to form a 
characteristic signature. The Examiner's interpretation is faulty, because a "gene 
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expression profile" for 12,600 genes, thus having 12,600 gene expression values for 
12,600 different genes, does not read on a characteristic signature having a plurality of 
values for a single gene having been measure from a plurality of samples taken from a 
single heterogeneous region, wherein the heterogeneous region contains diseased tissue 
in a first portion, and no diseased tissue in a second portion. Singh et al. does not form 
characteristic signatures as claimed. Also, Singh et al. analyzes the normal tissue 
samples separately from the tumor tissue samples. 

The Examiner asserted in AA 03/21/07 that the Gleason score (GS) meets the recitation 
in Claim 1 of a 'trend profile of a second tissue measured property for the second type of 
tissue along the determined profile of locations through the heterogeneous tissue region. 
The Examiner referred to Singh et al., page 204, column 2, paragraph 2, lines 2-5 as 
support for this assertion. This portion of the Singh et al. reference, as noted above, 
refers to analysis of 'the expression pattems within the 52 tumors". Accordingly, it is not 
possible to consider a trend profile of a measured property for the second type of tissue 
(tumor) along the determined profile of locations through the heterogeneous tissue 
region, by this disclosure of Singh et al., because the determined profile of locations is 
defined in Claim 1 as including at least one sample being taken form the second portion 
which is devoid of tumor. Since Singh et al. analyzes only tumor tissues in the portion of 
the disclosure referred to by the Examiner, it follows that Singh et al. does not include a 
measurement from a nontumorous location in a trend profile. 

The Examiner argued in AA 03/21/07 that Singh et al. takes samples from two successive 
locations in the same prostate specimen. Appellant/ Applicant respectfully submits that 
the 52 tumor samples and the 50 normal samples are analyzed separately, since the gene 
expression values of the 52 timior samples are rank ordered amongst themselves and the 
gene expression values of the 50 normal samples are rank ordered amongst themselves, 
Singh et al. fails to disclose forming a characteristic signature from a gene expression 
value for a gene (e.g., "gene A") from a tumor sample and a gene expression value for the 
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same gene (gene "A") from a normal sample adjacent the tumor sample, and then 
comparing such a characteristic signature with a trend profile containing values 
corresponding to the same two locations from which the gene expression values were 
read. 

Cl(b). Application of Standard of Rejection under 35 USC § 102 to Claim 6 

Appellant/Applicant submits that the Examiner has not met the burden of presenting a 
proper rejection under 35 U.S.C. § 102(b) because not every limitation of claim 6 
identically appears in the Singh et al. reference. 

Claim 6 depends from claim 1 and, it is respectfully submitted, is allowable for at least 
the reasons provided above with regard to claim 1 . Further, claim 6 recites in part 
"comparing each of the plurality of characteristic signatures with the provided trend 
profile by curve-fitting to a statistical regression fiinction, wherein said curve-fitting 
determines the degree of proximity of each of the plurality of characteristic signatures to 
the provided trend profile/' 

In the Office Action dated 08/28/2006 (OA 08/28/2006), the Examiner referred to Singh 
et al., page 208, asserting that this discloses the above-recited claim language of Claim 6. 
However, the "K-nearest neighbor" techniques described by Singh et al. are used to 
determine Euclidean distances between gene expression values of individual genes. 
There is no disclosure of comparison to a trend profile, or of providing characteristic 
signatures as claimed. 

Dl. Conclusion 

Based on the arguments set forth, Appellant/Applicant submits that under the Standard of 
Rejection under 35 USC § 102, Claims 1-13 are not properly anticipated by Singh et al. 
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A2. Rejection of Claims under 35 USC § 102(e) - Crosby et aL 

In OA 01/08/07, the Examiner rejected Claims 1-10, 12-13 and 17-20 under 35 USC § 
102(e) as being anticipated by Crosby et al. 

B2. Standard of Rejection under 35 USC § 102 

Under 35 U.S.C. § 102, every limitation of a claim must identically appear in a single 
prior art reference for it to anticipate the claim. In re Bond, 910 F.2d 831,832, 15 
U.S.P.Q.2D (BNA) 1566, 1567 (Fed. Cir. 1990). 

C2(a). Application of Standard of Rejection under 35 USC § 102 to Claim 1 

Appellant/Applicant submits that the Examiner has not met the burden of presenting a 
proper rejection under 35 U.S.C. § 102(b) because not every limitation of Claim 1 
identically appears in the Crosby et al. reference. 

The Examiner in AA 03/21/07 asserted that Crosby et al. discloses at paragraph [0025] 
obtaining a plurality of samples form patients having positive and negative disease 
outcomes, which the Examiner considered to read on the claimed plurality of samples. 
Appellant/Applicant respectfully traverses this interpretation. Claim 1 specifically recites 
that the "plurality of samples" are taken "from a heterogeneous tissue region, wherein the 
heterogeneous tissue region includes a first portion having at least first and second types 
of tissue, bordered by a second portion, said second portion considered to be devoid of 
the second type of tissue. It is respectfully submitted that a plurality of samples taken 
from a single heterogeneous tissue region is distinct firom the disclosure of Crosby et al. 
of taking samples from a plurality of patients, and thus a plurality of tissue regions. 

The Examiner further asserted in AA 03/21/07 that Crosby et al. discloses measuring 
from a plurality of samples taken from a heterogeneous tissue region at paragraph [0080], 
where multiple sequential tissue slices are analyzed. Although Crosby et al. does 
disclose analyzing multiple sequential tissue slices, Crosby et al. also discloses at 
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paragraph [0080] that the tissue slices or cells are analyzed individually, either 
sequentially or in parallel. There is no disclosure of forming characteristic signatures as 
claimed. For example, there is no disclosure of forming a characteristic signature of 
protein activity values for a particular protein across multiple sequential slices or cells of 
a tissue region. Rather, particular cells having activated proteins are identified and then 
compared to a protein activity value of a normal cell. In contrast, the present invention 
forms a characteristic signature of values taken from both normal and diseased tissues at 
determined locations along a heterogeneous tissue region, and compares the characteristic 
signature to a trend profile of values from the same locations that the values for the 
characteristic signature were taken. This is neither disclosed nor suggested by Crosby et 
al. 

The Examiner also referred to paragraph [0092] of Crosby et al. in AA 03/21/07, 
asserting that this discloses determining the correlation of protein activity (characteristic 
signature) and a disease outcome (trend profile). Again, the Examiner has incorrectly 
interpreted the recited elements "characteristic signature" and "trend profile". Paragraph 
[0092] indicates that cluster analysis may be performed on protein activity and a disease 
outcome, such as survival, or death. It is respectfully submitted that it would be 
nonsensical to consider forming a trend profile of values along determined locations of a 
heterogeneous tissue region if the values for the trend profile were either death or 
survival, since either the tissue survives or it dies. Accordingly, only a flat trend profile 
could be prepared according to the Examiner's interpretation, either having all values of 
"death" or all values of "survival". This would result in a flat trend profile, which would 
not be useful for analysis. 

Additionally, as already noted, Crosby et al. does not disclose forming a characteristic 
signature of values for a single particular property, across samples at determined 
locations of a heterogeneous tissue. 
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C2(b). Application of Standard of Rejection under 35 USC § 102 to Claim 17 

Appellant/Applicant submits that the Examiner has not met the burden of presenting a 
proper rejection under 35 U.S.C. § 102(b) because not every limitation of claim 17 
identically appears in the Crosby et al. reference. 

It is respectfully submitted that Claim 17 is allowable for at least the same reasons 
provided above with regard to Claim 1, as Claim 17 contains all of the claim limitations 
distinguished above with regard to Claim 1. 

Further, the Examiner in AA 03/21/07 asserted that Crosby et al. discloses at paragraph 
[0013] automatic analysis using high-throughput automation. It is respectfully submitted 
that paragraph [0013] of Crosby et al. is a part of the background section that identifies 
what would be desirable, and does not disclose anything concrete, but rather indicates 
what would be desirable to achieve. Further, paragraph [0013] does not disclose a 
computer readable medium. 

C2(c). Application of Standard of Rejection under 35 USC § 102 to Claim 18 

Appellant/ Applicant submits that the Examiner has not met the burden of presenting a 
proper rejection under 35 U.S.C. § 102(b) because not every limitation of claim 18 
identically appears in the Crosby et al. reference. 

It is respectfully submitted that Claim 18 is allowable for at least the same reasons 
provided above with regard to Claim 1, as Claim 18 contains claim limitations 
distinguished above with regard to Claim 1 . 

Further, the Examiner in AA 03/21/07 asserted that Crosby et al. discloses at paragraph 
[0013] automatic analysis using high-throughput automation. It is respectfully submitted 
that paragraph [0013] of Crosby et al. is a part of the background section that identifies 
what would be desirable, and does not disclose anything concrete, but rather indicates 
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what would be desirable to achieve. Further, paragraph [0013] does not disclose a 
system. 

Still further, since Crosby et al. does not provide a trend profile as claimed, for reasons 
discussed above with regard to Claim 1, it follows that Crosby et al. does not provide 
means for providing a trend profile as part of a system. 

Since Crosby et al. does not perform statistical analysis on a plurality of characteristic 
signatures with regard to a trend profile as claimed, it follows that Crosby et al. does not 
provide means for performing statistical analysis on each of the characteristic signatures 
with regard to the provided trend profile. 

D2. Conclusion 

Based on the arguments set forth, Appellant/Applicant submits that imder the Standard of 
Rejection under 35 USC § 102, Claims 1-10, 12-13 and 17-20 are not properly 
anticipated by Crosby et al. 

E. Conclusion 

The Examiner has failed to show anticipation of Claims 1-13 under 35 USC § 102(b) 
over Singh et al. 

The Examiner has failed to show anticipation of Claims 1-10, 12-13 and 17-20 
under 35 USC § 102(e) over Crosby et al. 
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Accordingly, Appellant/Applicant requests that the Board of Patent Appeals and 
Interferences reverse the rejection of Claims 1-13 and 17-20 on the grounds set forth 



John Brady 

Agilent Technologies, Inc. 
Legal Department, DL429 
Intellectual Property Administration 
P.O. Box 7599 
Loveland, CO 80537-0599 
Telephone: (408) 553-3584 
Facsimile: (408) 553-2365 



herein. 



Respectfully submitted. 




<Alan W. Cannon for John Brady 
Registration No. 34,977 
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VIII, Claims Appendix 

This Appendix contains the Claims involved in the Appeal: 

1. A method for rank ordering characteristic signatures of properties, said method 
comprising the steps of: 

forming a plurality of characteristic signatures, each said characteristic signature 
being formed of values for a particular property having been measured from a plurality of 
samples taken from a heterogeneous tissue region, wherein the heterogeneous tissue 
region includes a first portion having at least first and second types of tissue, bordered by 
a second portion, said second portion considered to be devoid of the second type of 
tissue, wherein the plurality of samples have been taken from successive locations along 
a determined profile of locations through the heterogeneous tissue region, with at least 
one sample being taken from the second portion, and wherein each said characteristic 
signatures are formed from different ones of said particular properties, respectively; 

providing a trend profile of a second tissue measured property for the second type 
of tissue along the determined profile of locations through the heterogeneous tissue 
region; 

performing statistical analysis on each of the plurality of characteristic signatures 
with regard to the provided trend profile; and 

rank ordering the plurality of characteristic signatures based on proximity to the 
trend profile as determined by the statistical analysis. 

2. The method of claim 1, fiuther comprising the step of: 

measuring the specific property for each of the plurality of samples for at least 
one of characteristic signatures. 

3. The method of claim 1, fiirther comprising the steps of: 
providing the heterogeneous tissue region: and 
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taking the plurality of samples from the heterogeneous tissue region. 

4. The method of claun 3, further coniprising the step of: 

measuring each specific property for each of the plurality of samples for each 
respective characteristic signature. 

5. The method of claim 1, wherein the step of forming a plurality of characteristic 
signatures includes normalizing each of the plurality of characteristic signatures with 
respect to a baseline reference signature, said baseline reference signature corresponding 
to a measured property of a sample taken from the second portion. 

6. The method of claim 1, wherein the step of performing statistical analysis 
includes: 

comparing each of the plurality of characteristic signatures with the provided 
trend profile by curve-fitting to a statistical regression function, wherein said curve-fitting 
determines the degree of proximity of each of the plurality of characteristic signatures to 
the provided trend profile. 

7. The method of claim 1, wherein the step of performing statistical analysis 
includes: 

calculating a p-value with regard to each of the plurality of characteristic 
signatures, to test the null hypothesis between each of the plurality of characteristic 
signatures and the provided trend profile. 

8. The method of claim 1, wherein the step of performing statistical analysis is 
done in one-, two- or three-dimensional space. 

9. The method of claim 1, wherein the first type of tissue is healthy tissue. 
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10. The method of claim 1, wherein the second type of tissue is diseased tissue. 

1 1 . The method of claim 1, wherein one of the specific properties is an 
expression level of a gene. 

12. The method of claim 2, wherein the step of measuring the specific property 
comprises measuring different specific properties across the samples to form a plurality 
of characteristic signatures by processing each of the plurality of samples using a 
microarray technique. 

13. The method of claim 13, wherein said processing comprises processing each 
of the plurality of samples on a single two-color microarray, two single-color microarrays 
or both. 

17. A computer readable medium carrying one or more sequences of instructions 
for rank ordering characteristic signatures of properties measured fi-om a plurality of 
samples taken fi-om a heterogeneous region, wherein a first portion of the heterogeneous 
tissue region has at least first and second types of tissue and is bordered by a second 
portion of the heterogeneous tissue region, wherein the second portion is considered to be 
devoid of the second type of tissue, and wherein the plurality of samples have been taken 
firom successive locations along a determined profile of locations through the 
heterogeneous tissue region, with at least one sample being taken firom the second 
portion, wherein execution of one or more sequences of instructions by one or more 
processors causes the one or more processors to perform the steps of: 

forming a plurality of characteristic signatures, each said characteristic signature 
being formed of values for a particular property having been measured from the plurality 
of samples; 
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providing a trend profile of a second tissue measured property of the second type 
of tissue along the determined profile of locations through the heterogeneous tissue 
region; 

performing statistical analysis on each of the plurality of characteristic signatures 
with regard to the provided trend profile; and 

rank ordering the plurality of characteristic signatures based on proxunity to the 
trend profile as determined by the statistical analysis. 

18. A system for rank ordering characteristic signatures of properties generated 
from tissue samples taken fi-om a heterogeneous tissue region, wherein a first portion of 
the heterogeneous tissue region has at least first and second types of tissue and is 
bordered by a second portion of the heterogeneous tissue region, wherein the second 
portion is considered to be devoid of the second type of tissue, the system comprising: 

means for providing a trend profile of a second tissue measured property of the 
second type of tissue along a determined profile of locations through the heterogeneous 
tissue region fi-om which tissues samples are taken as the sources of the characteristic 
signatures; 

means for performing statistical analysis on each of the plurality of characteristic 
signatures with regard to the provided trend profile; and 

means for rank ordering the plurality of characteristic signatures based on 
proximity to the trend profile as determined by the statistical analysis. 

19. The system of claim 18, fiirther comprising 

means for forming the plurality of characteristic signatures based on 
measurements of values for a particular property measured from the plurality of samples, 
each of said characteristic signatures formed fi-om values for a particular property 
different from the particular properties measured to form others of the characteristic 
signatures. 
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20. The system of claim 1 8, fiirther comprising: 

means for measuring at least one said specific property for each of the plurality of 
samples. 
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IX. Evidence Appendix 

This Appendix contains copies of: 

A. Singh et al., "Gene expression correlates of clinical prostate cancer behavior", 
Cancer Cell: March 2002, Vol. 1,203-209. 

This reference was entered by Examiner Karlheinz R. Skowronek in the Office Action 
dated 08/28/2006, 

B. Crosby et al., U.S. Patent Publication No. 2003/0190689 Al . 

This reference was entered by Examiner Karlheinz R. Skowronek in the Office Action 
dated 08/28/2006. 

X. Related Proceedings Appendix 

This Appendix has no copies of decisions in related proceedings because there are no related 
proceedings to this Appeal. 
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Summary 

Prostate tumore are among the most hetetoganeous of cancers, both histologically and clinically. Microarray expression 
aiRilysis was used to determine whether global blolotflcal differences underlie common pathologteal features of prostate 
cancer and to identify genee that might anticipate the clinical behavior of this disease. While no repression correlates of 
age, serum prostate specific antigen (PS/4, and measures of local Invasion were found, a set of genes was identified that 
strongly correlated with the state of tumor differentiation as measured by Gteason score. Moreover, a model using gene 
expression data alone accurately predicted patient outcome following prostatectomy. These results support the notion 
that the clinical behavior of prostate cancer Is linked to underfying gene expression differences that are detectable at the 
time of diagnosis. 



Introduction 

Prostate cancer is the most common nondermatoiogical cancer 
in the United States with an estimated 196,100 new cases and 
31 ,500 deaths In 2001 (Greenlee et ai., 2000). The adoption of 
screening based upon the measutement of the serum prostate spe- 
cific ant^sn (pSA) has led to the earlier detection of prostate 
cancer where most tumors now appear confined to the prostate 
gland at presentation (Han et al., 2001 ). Early diagnosis provides 
an opportunity for curative surgery. However, up to 30% of men 
undergoing radical prostatectomy will relapse, often as a result 
of mlcrometastatic disease present at the time of surgery (Rob- 
erts et ai., 2001a, 2001b). Hie chaUenge is to identny those 



patients at risli for relapse and to better understand the molecu- 
lar abnomiallties that define iunws at risic for relapse. 

Several clinical features of prostate cancer including tumor 
stage(Jewett, 1975). degree of tumor cell differentiation or Glea- 
son score (GS) (Gleason; 1966), and the serum PSA (Stamey 
et ah, 1987) are used in routine clinical practice to separate 
men into groups at low, intemnediate, and high rlsIc for tumor 
recunmce following local therapy. However, the m^rity of 
patients who now undergo prostatectomy have low to interme- 
diate rlsic clinical features, and determining the prognosis for 
these patients remains difficutt. 

The utility of existing prognostic factors might be limited 
because they largely measure tumor differentiation and buiii but 



SIGNIFICANCE 

improved potient gratification con allow the roHonal oppilcoHon of cuneni treotmenls and the selected testing of novel therapeutics 
In poHenl popufoHons mod Ucely to benem. CRnlcoi features Includtog Gleoson score, tumor ilage, ond lemm prostate specHk: 
antigen (PSA) m used to asset* relopte-rlsk In men with prostate cancer. Such parometers are fess useful In guiding therapy for 
men having Intemiedlote rtdc disease, 3M of whom recur folbwing locol ttieropy. Our doto suggest that expression -based models 
may help to Wentffy poflenis of greotediiskfor recurrence ond thus fadlHale IheroHonol applcatlon of current theroples. Furthemiore, 
the ossoclollon of spedSc genes* expresdoa such as ptalelet-deflved growth Idetor p (PDGFRp), wHh outcome roOses the posdbllBy 
thai expresdon anoiyds may prove useful In selecting patients for em^lng mechonlim-based therapeutics. 



CANCER CEIL : JVIARCH 2002 • VOL. 1 • COPYRIGHT O 2002 CELL PRESS 



203 



R E P O R 1 



do not otherwise sample the underlying biological properties 
that likely drive tumor behavk)r. Attempts to explore genetic 
oonretates of tumor behavkDr have found aKerations tn a numtser 
of candkjflle genes associated with prostate cancer pn>ores^ 
slon, including loss of p53, ampliflcatJon of myc» loss of p27, 
and loss of PTEN (reviewed in Sellers and Sawyer3» 2001). 
However, no single gene has been shown to have sufficient 
prognostic utility to wanrant clinical implementation* 

Recently, genomic methodologies have t>een used to dis- 
cover consistent gene expression pattens associated with a 
given histologicalorcnnlccdphenotype (Golubetai., 1999;Parou 
et ed., 2000; van't Veer et al., 2002). Here, gene expression 
patterns from 52 tumor and 60 nonmal prostate specimens were 
studied in order to ask whether such patterns could be used 
to predk:t common cfmical and pathok>gical phenotypes rale- 
vant to the treatment of men diagnosed wHh this disease. In 
addition to expression patterns that correlated with QS and 
with the distinction of tumor from normal, an expression-based 
model was built that accurately predicted patient outcome. 
Ihese data suggest that it may be possible to predict the dinlcal 
behavkir of prostate cancer based upon gene expression analy- 
sis of primary tumors. Such prediction strategies, if generaliz- 
able, would dlow for the rational application of additional post- 
surgical therapeutics to hlgh*rlsk individuals. 

Reautts 

Tumor versus normal oiassifleation 

To Investigate whether robust gene expression differences 
could be found that distinguished common clinical and patho- 
logical features of prostate cancer. 235 radical prostatectomy 
specimens were analyzed from patients undergoing surgery 
between 1995 and 1997. Of these samples. 65 had tumor on 
opposing sides of the tissue specimen. High-quality expression 
profiles were successfully derived from 52 of these prostate 
tumors and 50 nontumor prostate samples (referred to as normal 
hereafter) using oligonucieotide mlcroarays containing probes 
for approximately 12,600 genes and ESTs (raw data available 
at httpyywww-genome.wi.mitedu/MPFVprostate). The clinical 
and pathological features of the 52 patients and tumors included 
in this study were indistingutehabte from those of ail patients 
treated with radical prostatectomy during the collection period 
(Table 1). 

Genes were ranked according to their differential expression 
across the two classes (tumor versus normai) using a variation of 
a signal-to-ndse metric (S2N) (Golub et al., 1 999). The statistical 
significance of these gene expression connelatlons was deters 
mined by comparing the observed conBlations to the results 
derived from 1000 permutations of the class labels (tumor or 
normal). This analysis Indicated that 317 genes had higher ex- 
pression in the tumor samples (p s 0.001) whereas 139 genes 
were more highly expressed in normal prostate samples (p ^ 
0.001) (Supplemental Figure SI fsee Suwilementai data. betowD. 

Gene expression differences between tumor and normal 
prostate samples have been previously reported (ChetcutI et 
al^, 2001 ; Dhanasekaran et al., 2001 ; ijuo et al. 2001 ; Welsh et 
al.. 2001); however, the feasibility of using such differences to 
predict the Identity of prostate samites has not been tested. 
To this end, we bulft predictors using a /r-nearest neighbor 
(fc-NN) supervised machine learning algorithm. IWIodels that uti- 
lized 4 or more genes classified tomples with greater than 90% 



accuracy in leave^e-out cross^idation testing (p < 0,001 
as measured by pemiutation testing) (Suppl. Rgure S2A and 
S2B). The 4-gene and 1 6-gene models were tested on an inde- 
pendent data set of 8 nonral and 27 tumor prostate samples 
provided by G. Hampton (Welsh et al., 2001). Despite a nearfy 
10-fold difference In overatt mlcroaray Intensity beftween these 
datasets (see Supplemeiila Experimental procedures (belowD, 
the classifier perfbnmed witi relatively high accuracy (4^ene 
model 77% ; 1 6-gene mode 86%; p < 0.05, Hsher's exact test) 
(Suppl. Figure S3). Thus, expi>95sk)n differences can be used 
to predict the kilentity of un <nown prostate samples and these 
gene expresston differenoes are conserved across Independent 
data sets. 

Prediction of pathologic^ features of prostata cancer 

In order to ask whether g'ene expression patterns exist that 
describe and or predict the differences in clinical behavkM* ap- 
parent among prostate tumors, the expression patterns wWiln 
the 52 tumors were analyased. Conrelatlons between gene ex- 
pression and known cUnlc^i and pathological parameters were 
determined for dichotomojs variables (e.g,T the presence or 
absence of capsular penetiation, perineural invasion, or positive 
surgical margins), as well ^ for factors treated as continuous 
variables (e.g., patient ag6, semm PSA, and GS). Statistical 
significance was determirKld by comparing the obsen^ed corre- 
lations to those congelations measured in randomly permuted 
datasets. With the exception of GS (see t>elow), no statistically 
significant gene expression conrelates of these clinical and 
pathological features wer^ observed (see Suppl. Figure S4). 
Specifically, no expression' signature dlscrimn^ated t>etween lo- 
cally invasive and noninvaslive phenotypes (e.g., capsular pene- 
tration, positive surgical margins, and perineural invaston). Thus, 
while these features are o'ten associated with different clinical 
outcomes, they are not reflected by global gene expression 
differences. 

A gene expression signature of GS, however, was detect- 
able. Fifteen genes had etxpression positively correlated with 
GS (Type I) and 14 genes lad expression negatively correlated 
with GS (Type II) beyond what would be expected by chance 
alone (p s 0.001) (Figure' 1 and Suppl. Rgure 34), As these 
genes were the most posHively and negatively correlated with 
GS, when used In hieranphlcal clustering, the 29 Type i and 
Type II were, as expected, separated Into two groups (Figure 
1). The con-elation of thes^ genes with GS and their coordinate 
expression in tumors, nonetheless, may have occurred by ran- 
dom chance alone in the initial dataset However, when the 
same 29 genes wero used to drive hierarchical clustering of the 
indepmdent data set, Typje I and Type 11 genes remaned highly 
cosegrsgated suggesting that this coexpression is reprodudble 
(p < 0.0001) (Suppl. Figu e S5). 

Strikingly. In both data sets, while most high-grade tumors 
expressed the Type I geies. a subset of intermediate grade 
tumors also expressed many of the Type I genes [Figure 1 and 
SuppL Figure S5). This inc icates that some tumois of intennedi- 
ate histological grade shjure the gene expression signature of 
higher grade tumors. 1ms, the coexpression of these genes 
may identify tumors that are of intermediate histological grade, 
yet share the molectdar fihenotype of Ngh-grade tumors. 

Prediction of clinical outcome 

In this data set, 21 patients were evaluabte with respect to 
recunrence following sunifery vi^ 8 patients having relapsed 



204 



CANCER CSLl : MARCH 2002 



REPORT 



toble 1. ClMcol and poihologioal feotures 



Vartoble 




Study group 


AB 


P 


Recurrent 


Nonrecurrent 


P 


«Potients 




S2 


393 




o 
o 


13 


0.74 


Age 


Median 


5B.5 


61 






60J0 


Range 


47-72 


jn_TD 






47-72 


0.64 


PSA 


Mecfron 


6.3 


6./ 


n tA 


6A 


6.3 


Range 


t.0-27^ 


0,7-46.0 






3.6-18.0 


045 


Gleason 


2-6 


1? |37%) 


190 (51%) 


0.10 


2 (25%) 




Score 


7 


29 (56%) 


146 i3y%i 






6 146%) 




(Cinicol) 


8^10 




34|9%| 






1 18%) 




Unknown 




23 








oxti 


Gleoson 


2-6 


24 (46%) 




9.46 




7 154%! 


Score 


7 


22 (42%) 






3 (38%) 


6 (46%) 




(Sample) 


8-10 


6 (12%f 






4 l^VToJ 


V 


IjOO 


Cin'CQl 


Tl-T2a 


38 (88%) 


285 (79%) 






10 (91%) 


Stage 


T2b 


5 112%) 


24(7%) 




U 


9 r9%k 




2tT2c 


0 


50 (14%) 




A 

\i 


Q 






Unknown 


y 








2 




Pothologic 


T2a 


7 (13%) 


49 (15%) 


A 1C 

0.1 3 




2(15%) 


0.90 


Stage 


T2b 


25 (48%) 


too tcocni 

189 |5B%| 




4 1 W7bJ 






T3a 


1Z 191 Kl 


9n (mm J 




2 (2S%> 


4 (31%) 






T3b 


4 18%) 


12 (4%) 




1 11^1 


2(15%) 






T4a 


0 


2 (1%) 




A 
V 


V 






Unknown 




53 








0.15 


Glond 


Median 


51.75 


53»0 


o>By 






Vol 


Range 


35-191 


18-191 




35.S-191 


35-169 


1.00 


Exr. Cap* 


No 


32 (62%) 


239 (73%) 


0.10 


5(63%) 


7(54%) 


rei 


20 (38%) 


88 (27%) 




3(37%) 


6(46%) 






Unknown 




66 






12 (92%) 


1.00 


SVInv. 


No 


49 (94%J 


315 (96%) 


0<44 


7t88%} 




Yes 


3(6%| 


12 [4%) 




1 (t2%) 


1(8%) 






Unknown 




66 






7(54%) 


1.00 




No 


39 (75%) 


283 (76%) 


0.86 


5(62%) 


Yes 

Unknown 


13(25%) 


89 (24%) 
21 




3 (3B%) 


6 (46%) 





PSA. «enjm prostote specific ontigen; Vol.. Volume; SV Inv.. setninal vesfcle invoslon: Ext Cap. Brfenslon through copsute; Poj. Margin. posHive surgical 
resection mor^n. Gleason Score (crinicolj Indicates me Gleoson Score recorded from me rodicol prostatectomy specimen, Gleoson Score (Somple) 
indicates the Gteason Score of the frozen sections from the tumor specimens used in RNA preporgtion. -p volue resutting from the comparison of the 
Gleason Score (Somple) of the 52 tumors to the G1eo$on Score jClnlcol) tiom the entire poputotfon. 



(defined as two suocesstve PSA values > ng/ml) and 13 
patients having remained relapse free for at least 4 years. While 
these two groups did not differ with respect to itie Clinical OS, 
semm PSA, or tumor stagsi the GS of the sections adjacent to 
tissue used for RNA extraction was ^8 in a greater proportion 
of recun-errt patients (4/8 versus Q/13) (Table 1). 

While no single gene was statistically associated wHfi recur- 
rence (at p ^ 0.05) {data not shown), when a /f-NN classification 
approach was applied, a S-gene model with 2 nearest neighbors 
(k » 2) reached 90% accuracy in predicting recurrence during 
leave-one-out cross validation. When Kaplan-Meier survival 
analysis was peitomied based upon the predicted outcome, the 
results compared favorably with Imown prognostic indicators In 
this data set (Rgure 2B). However, the standard Kaptan-Meler 
log-rank statists, while demonstrating a difference in the sur- 
vival curves^ does not account for the multiple hypothesis testing 
that occurred during model optimization. To furttier assess the 
statistical significance of ^is prediction model, we perfbmied 
1 000 permutations of the class labels (recurrence versus nonre- 
currence), and for each pemuitatk>n attempted to find muttlgene 
expression classifiers u^g the same range of gene numbers. 
Only 37 of the 1000 permutations yiekied models whose accu* 
racy matched or exceeded 90%. Thus, the likelihood of matoh- 



ing the success of the observed 5-gene model simply by chance 
alone was estimated at p 0.037 (Figure 2A). 

While there were too few tumor samples to allow for mijdtivar- 
late analysis, as mentioned above, only the Sample GS vras 
significantly different between patients who recurred and those 
who did not recur (Figure 2B and Table 1). Nonetheless, 4 recur- 
rent tumors were of intemned'iate grade (GS ^ 7) raising the 
possibiirty that gene expression-based models might provide 
additional prognostic infonnation not currently described by 
existing clinical and pathological parameters. 

TYie genes that were used by the 5-gene outoome predictors 
during leave-one-out cross valuation are shown in Rgure 3. 
TTie top 5 genes were each used In over half of the models, 
and included chronrK>granin A, platelet-derived growth factor 
receptor p pDQFRp), H0XC6, inositol triphosphate receptor 3 
(IPTR3) and $l^transferase-1 . 

Discussion 

There is an immediate need for robust prognostic markers capa- 
ble of identifying patients at risk of relapse following local ther- 
apy; conventional and experimental therapeutics could then be 
focused on this subpopulatfon, rather than the general popula- 
tion of prostate cancer patients, 70% of whom are cured by 
surgery atone. 
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rigm 1. Gene expreston conelotes of Gleoson seora 
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Our analysis revealed glc^al gene expression differences 
that were sufficiently robust to distinguish tumor' from nomial 
in both training and validation sets. While the level of accuracy 
(B6%-92%) Is not sufficient to n^ce histological mmlnatton, 
these molecular markers may beusefUl adjuncts to morphology- 
based diagnostics. In addition, while certain genes differentially 
expressed between ncmnal and tumor prostate specimens in 
mlcroan^ experiments have been correlated with outcome in 
large data sets (Dhanasekaran et aL, 2001), In our data such 
differentially repressed genes were not highly correlated with 
outcome. 

Among prognostic factors for prostate cancer, serum PSA 
and measures of local Invasion were not associated with robust 
gene expression signatures. The tack of an associated gene 
expression sfgnature does not exclude the possibility that such 
signatures exist. It ia possible that analyzing genes beyond those 
present on the cunwit microanray, or extending the experiment 
to larger data sets might reveal such patterns. However, there 
was a readily detectable and statlsticaily significant signature 
of GS. The expression pattern of these genes separated tumors 
into (^tinct groups during hlefarcNcal clustering in both our 
Initial and In a validation data set and grouped some of the 
Intermediate-grade tumors with high-grade tumors. In so doing, 
this set of genes may help identify a subset of histologically 
Interniediate-grade tumora that may have more aggressive clini- 
cal behavtor. 

In thte data aet, G8 was associated wHh patient outcome 
CTable 1 and Rguio 2B): however, only two of the genes corre- 
lated with GS (IQFBP-3 and C0L1A2) contributed to the but- 
come prediction model (Figure 3). instead, genes whose expres- 
sion was not correlated with GS were the most frequently used 



in the outcome prediction model, suggesting that while GS is 
associated with patient outcome, GS-indepwident markers and 
determinants of prostate cancer behavior exist. 

Attempts to build a gene expression-based predictor of re- 
currence fblk)wing proef atectomy led to a model that correctly 
predicted the outcome of 19 of the 21 evaluable patients in this 
study. While the result reached statisScal significance based 
on pemiutation te;^lng (p = 0.037), the perfomnance of the 
model may be due, at least In part, to overoptimization. As such, 
tiis is a preliminary moi:lei and larger datasets wili be required 
to reach model stability, to minimize the possibility of model 
overflttrng end ultimattJy allow the independent validation of 
such a predictor. 

Despite these limltailons, the Wentity of the genes comprise 
Ing the outcome prediction model support the existence of mea- 
surable outcome detonninants for prostate cancer recurrence. 
For example, diroiiiO|.ianln A, one of the 5 genes most fm- 
quentty used in the prediction model, has previously been asso- 
ciated with poor outcome in prostate cancer (Theodorescu et 
al., 1997). The utility o^ PDGFRp expression In the recunrence 
predictor is also iriirig Jing in light of the recent observations 
that PDQFH (a and p) ue expressed In advanced prostate can- 
cer (Chott 0t ai., 1999). The successful predictkin of patient 
outcome will ultimately lead to lmpn>ved decision making re- 
garding current therapeutic options and the rational selection of 
patients at high risk for relapse for clinical trials testing adjuvant 
therapeuttes. Furthemore, the identification of genes whose 
expreasion drives outcome and whose protein products are 
tractable targets fbr siinalt molecules may contribute to the de- 
velopment and selective application of novel mechanism-based 
treatments. Ongoing "rials of Gleevec, an Inhibitor of the abi, 
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p=0,037 (5-gene model) 
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Hgufie 2. Outcome prediction models 

A: It^e success rotes of models pnedicting out- 
come, Leove-one-out cross voDdoflon wo) used 
to build outcome predidlon models (recurrent 
versus r^onrecurrent) using from 1 to 256 genes. 
The X axis indicates number of genes used in 
model bgkfing, and the y oxis incffcotes fhB fre- 
quency of success. Shown is the number of cer- 
met predictions divided by the totol number of 
predictioru (red line) in the observed dota using 
leove-one-out cross validotion. The mean suc- 
cess rate ± ttte standard deviotion {boltom 
do«hed h'ne) and maximum success rate (top 
dashed Kne) obtoined using the permuted data 
Isshown, 

B: Dseose-fiee survival of patients itiatUed 
based on ttie S-gene rvwdel GS, serum PSA, or 
combined risk. Koplon'meb- analysis wos used to 
plot the froctlon of at-rlsltpotents lemolrtirto free 
of diseose (y axis) at ttie IrKliooted time after 
prDsfotectomy (tt am). SItown is patient strotift- 
coiion bosed on the S-gene model. GS 6 ver^ 
sus ft 7). serum PSA (< lo versus PSA « 10)« ond 
o combiriaDon of GS. serum PSA. ond surgical 
stage flow and Intermediate versus high risk). 
High risk was detined os a GS > 7. PSA a 20. 
and surgical stage T3 or higher, the remaining 
samples were considered low or intermediate 
risk. P volues were calculated using a log-rank 
test (Mantel-Hwen$ief tost}. 
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RgwttS. Genes used to buld on outcome pre- 
diction model 

The genes most cammoniy used in iShB 5-gene 
model ore shown os described for Hgure 2B. The 
expression of each gene (rowsj in each recurreni 
ornonrecufrentsomple (colurm) is represented 
by the number of stondord deviations obove 
|red} or betow (biue^ the mean for that gene 
ocross oil 21 somptes. 
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kit, and PDGFR tyrosine kinases, In prostate cancer will test 
the hypothesis that PDGFRp falls Into this jatter category. 

The samples in this study were derived from patients diag- 
nosed after the widespread adoption of PSA screening. As such 
the findings in this study are expected to l>e relevant to patients 
diagnosed today. Clearlyr larger, confirmatory studies will foe 
required priorto the implementatlofl of any changes In the clln^ 
cal care of patients with prostate cartcer. Neverttieless, these 
studies provide evidence that the clintoai phenotypes and be- 
havior of prostate cancer can be antkHpated by the analysis of 
tiie gene expression profiles. 

ExpeiitTiOfital procedures 

ProvtBte tissus samples 

From 1995 to 1997 samples of prostate tumors and adjacent prostate tissue 
not containing tumor (rsten'ed to as "normaH were collected from patients 
undergoing radical prostatectomy at the Bdgham and Women's Hospital. 
From 235 lumor" samples. 65 t)ad cancer present on opposing sides of 
the OCT embedded specimens. Each of these samples was reviewed by a 
single pathologist to determine the "Sample" GS, Otfwr pathological features 
f ram the rad ical prostatectomy specimens Included in this analysis as weil as 
from an contemporary prostatectomies were abstracted from the pathology 
rsporis Indiiding the "Ciinicar GS. 

The Wilcoxon rank sum taet CWtaoxin, 1945) and Fisher's exact test 
(Cox, 197p) was used to test for differences m oonllnuous varlabteB and 
dichotomous variables, respectively, between the study sample and all pa- 
tients treated tran 1993 to 1997 and between patients who recurred and 
those vt^o did not. Testsfor differences in these groups on ordered, catsgorV 
cal variables were done using the methods described by Mehta (Mehta 
and Patel« 1964}. Kaptan-Meir sunrival plots and log-iank statistics (Mantel- 
Haenszei lest) were generated using the S-Pkis statistical software package 
(Insightful Corp). 

C^e expression measurements 

Total RNA extraction, generatior of labeled cFti^, fragmentation, hybrMiza- 
tion to U95Av2 arrays (^ffymetrix) and vrash steps were performed as pre- 
viously described (Bhattachaijee el al^ 2001 ; Goiub et al.. 1 999). Raw expres- 
sion values were normalized to the median array Intensity and thresholds 
were set at 10 and 16.000 units. Genes whose exprsssion varied less than 
5-fotd tMtween any two sampiss in any given experimsnt wer^ removed. 

Gene rankifig, dass prsdiclion by A-neareat neighbors, 
and permutation testing for dichotomous vartablea 

Gene expression differences associated with a particular dichotomous class 
distinction were measured and ranked using a variation of the S2N statistic 
as previously described (Golub et aU 1999). Measured S2N values virere 
compared to cak:ulated S2N values obtained in 1D00 data sets where a given 
ctass tabel was randomly pemiuted (Good, 2000). For these oomparisons. p 
values represerrt the frequency at which the S2N statlslic from randomly 
permuted data exceeded the measured S2N statistn. 

K-nearest neighbor (ft-NN) class prediction models were built as pre* 
viously described (Pomeroy et at,, 2002). Briefly, for each class distinctnn 
tested, after exclusion of one sample, genes were ranked using the S2N 
metric derived from the remaining samples. The Euclidean distances (ED) 
between the withheld sample and the remaining samples were cak^jJated 
using a given number of genes (as in the figures). The identity of the left- 
out sample was predk:ted based upon the class membefshtp of the ^-K^losest 
samples weighted by the reciprocal of the EDs. Rvalues were then assigned 
based upon the frequency with which models generated and tested on 1 000 
randomly permuted data sets performed better than nnodeis generated using 
the obsBfved data. 

Correlation of gene expression with continuous variabias 
The Pearson coefficient was used to meaaura oorrelatlonB between gene 
expression patterns and patient age, serum PSA, and OS preated aft a 
continuous varable). Pearson coefficients were also used to measure the 
same correlations in data sets where the sample label of each variable tested 
(age, PSA, or GS) was randomly permuted 10,000 tknes. P-vakiee weits 



Mn assigned based upon the frequency with whkft Pearson ooefficlents 
generated from randomly psnmited data exoeadad flnso generated from 
the observed data. 

Independent prostate expivSftloi) data used for vaidfllion 

OligonuOleotkie array-based expieeelon data (AfiyniBtrix HumSSAvZ) and 
cfinioal data for 8 nornial and 27 pivattfte tumors were provkM Q. Hampton 
as a validation set (Welsh et al.. 2001). Gk)bal differences between the kirtiat 
and vacation data sets were quantified by determining the means of the 
mean array intensities. VaKdation of the tumor normal predk:t'»on nwdels 
and of Ifve ooexpresslon obsen^ for the genes highly oonelatod to GS was 
perfonned as described in the Si4>|ilemental experimental procedures (see 
Supplemental data, bekiw). 

Supplemental data 

Supplemental experimental prooedu^es and Figures S1^ can be found at 
http://www.canoeroelLorg/bgi/<^Uent/fult/1/2/203/DC1 and at http:/Atfww- 
genome.wi.mlt^uMPR/t>rpstate. 
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